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Abstract 

FLOWERING LOCUS T (FT) is the key flowering integrator in Arabidopsis (Arabidopsis thaliana), and its homologs encode 
florigens in many plant species regardless of their photoperiodic response. Two FT homologs, GmFT2a and GmFT5a, are 
involved in photoperiod-regulated flowering and coordinately control flowering in soybean. However, the molecular and 
genetic understanding of the roles played by GmFT2a and GmFT5a in photoperiod-regulated flowering in soybean is very 
limited. In this study, we demonstrated that GmFT2a and GmFT5a were able to promote early flowering in soybean by 
overexpressing these two genes in the soybean cultivar Williams 82 under noninductive long-day (LD) conditions. The 
soybean homologs of several floral identity genes, such as GmAPl, GmSOCl and GmLFY, were significantly upregulated by 
GmFT2a and GmFT5a in a redundant and differential pattern. A bZIP transcription factor, GmFDL19, was identified as 
interacting with both GmFT2a and GmFT5a, and this interaction was confirmed by yeast two-hybridization and bimolecular 
fluorescence complementation (BiFC). The overexpression of GmFDL19 in soybean caused early flowering, and the 
transcription levels of the flowering identity genes were also upregulated by GmFDL19, as was consistent with the 
upregulation of GmFT2a and GmFT5a. The transcription of GmFDL19 was also induced by GmFT2a. The results of the 
electrophoretic mobility shift assay (EMSA) indicated that GmFDL19 was able to bind with the cis-elements in the promoter 
of GmAPla. Taken together, our results suggest that GmFT2a and GmFT5a redundantly and differentially control 
photoperiod-regulated flowering in soybean through both physical interaction with and transcriptional upregulation of the 
bZIP transcription factor GmFDL19, thereby inducing the expression of floral identity genes. 
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Introduction 

Plants integrate various environmental signals, such as photo- 
period and temperature, to ensure flowering under those 
conditions that optimize seed production. In Arabidopsis thaliana 
(Arabidopsis), multiple pathways converge on a small number of 
floral integrator genes, which include the floral promoters 
FLOWERING LOCUS T (FT) and TWIN SISTER OF FT (TSF), 
to integrate photoperiod, temperature, vernalization, and light 
quality signaling [1]. FT and TSF are members of a family of 
proteins similar to the mammalian phosphatidylethanolamine- 
binding domain protein (PEBP) [2], [3]. In addition to the FT-like 
proteins, the plant PEBP family consists of two other phylogenet- 



ically distinct groups of proteins, the TERMINAL FLOWER 1 
(TFLl)-like proteins and the MOTHER OF FT AND TFL 
(MFT)-like proteins [4] -[8]. FT and TSF act redundandy to 
promote flowering under long-day (LD) photoperiods [7] , [9], [10]. 
Arabidopsis FT and TSF proteins produced in the phloem [7], 
[1 1] and are transported to the shoot apex, where they dimerize 
with the bZIP transcription factor FD to activate the expression of 
SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 [SOC1) 
[9], [12] and the floral meristem identity genes APETALA1 [API) 
and LEAFY (LET) [13], [14]. FT-like proteins from various species 
function in a manner similar to that of FT regarding the induction 
of flowering, transport in the phloem, and interaction with FD-like 
proteins [15] — [18], suggesting that this general mechanism is likely 
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widely conserved across flowering plants. However, the rice FT 
ortholog Hd3a interacts with OsFDl indirectly through a 14-3-3 
protein to form a ternary trimer known as the florigen activation 
complex in the nuclei of the shoot apex, where it activates the 
expression of O.sMADSIS, an API homolog that regulates 
flowering [19], [20]. 

Soybean, Glycine max (L.) Merr., is basically a short-day (SD) 
plant: its flowering is induced when the day length becomes 
shorter than a critical length. Soybean is grown at a wide range of 
latitudes, from at least North 50° to South 35°, although the 
cultivation area of each soybean cultivar is restricted to a very 
narrow range of latitudes. This wide adaptability has most likely 
been generated by genetic diversity at a large number of the major 
genes and quantitative trait loci that control flowering behavior. 
Nine major genes, El to E8 and J, that control flowering time and 
maturity have been previously identified in soybean [21]— [28] . 
Among these genes, El has been cloned using a map-based 
approach and is assumed to be a legume-specific transcription 
factor containing a putative nuclear localization signal and a B3 
distandy related domain [29]; E2 has been identified as an 
ortholog of the Arabidopsis GIGANTEA gene [30]; and E3 and E4 
have been confirmed as PHYA homologs using map-based cloning 
[31] and a candidate gene approach [32], respectively. Many 
allelic variations occur at the El, E3 and E4 loci, and their allelic 
combinations condition soybean flowering time, regulate preflow- 
ering and postflowering photoperiod responses, and contribute 
greatly to the wide adaptability of soybean [33], [34]. Two FT 
homologs, GmFT2a and GmFT5a, are involved in the transition to 
flowering and these two genes coordinately control flowering in 
soybean [35]. The maturity genes El, E2, E3 and E4 downreg- 
ulate GmFT2a and GmFT5a expression to delay flowering and 
maturation under LD conditions, suggesting that GmFT2a and 
GmFT5a are the soybean flowering integrators and major targets 
in the control of flowering [29], [30], [35], [36]. In addition, two 
SOC1 homologs, GmSOCl and GmSOCTlike, have been molecular 
characterized in soybean: the overexpression of GmSOCl partially 
rescued the late-flowering phenotype of the socl-1 Arabidopsis 
mutant under LD conditions [37], while the overexpression of 
GmSOCl -like promoted flowering in Lotus corniculatus [38]. These 
results suggest that the two soybean SOC1 homologs may function 
as floral activators in soybean. A soybean API homolog, GmAPl, 
has also been isolated in soybean and is specifically expressed in 
the flower, especially in the sepals and petals. This gene caused 
early flowering and the alteration of floral organ patterns when 
ectopically expressed in tobacco [39]. Despite the economic 
importance of soybean, knowledge regarding its molecular 
mechanisms of flowering remains limited. Here, we report that 
the overexpression of GmFT2a and GmFT5a in soybean can 
promote early flowering by activating the expression of floral 
identity gene homologs such as GmAPl, GmLFT and GmSOCl. The 
GmFT2a and GmFTSa proteins interact with the bZIP transcrip- 
tion factor GmFDL19. The overexpression of GmFDL19 in 
soybean can also induce the expression of floral identity genes. 
Additionally, we show that the bZIP transcription factor 
GmFDL19 is able to bind with the ACGT cis-element of the 
GmAPl promoter. Our results suggest that the putative flowering 
model FT/FD-AP1 is well conserved in the legume soybean and 
that GmFDL 1 9 may act as the key component in the photoperiod- 
regulated flowering pathway controlled by GmFT2a and 
GmFT5a. 



Materials and Methods 

Plant materials and growth conditions 

The soybean cultivars Harosoy, Williams 82 and Dongnong 50 
were used in this study. All plants were grown in a growth 
chamber (Conviron ADAPTIS-A1000, Canada) at a consistent 
temperature of 25°C and an average photon flux of 300 umol 
m _2 s _1 , supplied by T5 fluorescent lamps. Day length regimes 
were 12L/12D for SD and 16L/8D or 18L/6D for LD. Tissue- 
specific expression was analyzed using the cultivar Harosoy grown 
under SD. Total RNA was isolated from trifoliate leaves, shoot 
apices, roots, flowers, flower buds, and roots. For the temporal 
expression analysis, pieces of young, fully developed trifoliate 
leaves and shoot apices were bulk sampled at 4 hours after dawn 
from 4 individual plants grown under SD every five days from 10 
DAE until 25 DAE. The trifoliate leaves and shoot apices from 4 
plants of both transgenic and untransformed lines were bulk 
sampled at 4 hours after dawn at 20 DAE under the LD condition 
and stored until total RNA extraction. 

Soybean genetic transformation. The cDNA sequences of 
Harosoy GmFT2a/ 5a and GmFDL19 were first cloned into the 
pEASY-Tl vector (Transgene, Beijing, China). Xbal/SacI-digest- 
ed fragments were then inserted at multiple cloning sites in the 
pTFl 0 1 . 1 vector, and the transgenes were driven by the cauliflower 
mosaic virus 35S promoter [40]. The GmFT2a/ 5a-pTFl 01 and 
Gm.FDZ/.9-pTF101 constructs were used to transform the cultivars 
Williams 82 and Dongnong 50, respectively, following the 
cotyledonary node method [41]. TO, Tl and T2 transformants 
were screened by daubing 160 mg/L glufosinate into the prelim- 
inary leaves of the seedlings. Herbicide-resistant T2 plants were 
subjected to molecular and phenotypic analysis. 

RT-PCR and quantitative RT-PCR analyses 

Total RNA was isolated and cDNA was synthesized as 
described in Kong et al. [35]. RT-PCR of GmFDL19, GmAPl (a, 
b, c, dj, GmSOCla, GmSOClb, GmLFTl, GmLFT2 and Tubulin (as an 
internal control) was conducted using cDNAs synthesized from 
total RNA. PCR conditions were as follow: one cycle of 5 min at 
94°C; 30 cycles of 30 sec at 94°C, 30 sec at 55°C to 60°C 
(depending on the gene), and 30 sec at 72°C; and a final extension 
of 10 min at 72°C. RT-PCR was performed using homolog- 
specific primers to easily separate the RT-PCR products 
(approximately 500 bp) from the fragments amplified from 
genomic DNAs (>1 kb). The RT-PCR products were separated 
by electrophoresis in a 1 % agarose gel and visualized with EtBr 
under UV light. Quantitative RT-PCR was performed as 
described in [35]. The quantitative RT-PCR mixture was 
prepared by mixing a 1 ul aliquot of the reaction mixture from 
the cDNA synthesis, 5 ul of 1.2 uM primer premix, 10 ul SYBR 
Premix ExTaq Perfect Real Time (TaKaRa Bio), and water to a 
final volume of 20 ul. The analysis was conducted using the DNA 
Engine Opticon 2 System (Bio-Rad). The PCR cycling conditions 
were as follow: 95°C for 10 sec, 55°C to 60°C (depending on the 
gene) for 20 sec, 72°C for 20 sec, and 78°C for 2 sec. This cycle 
was repeated 40 times. Fluorescence quantification was conducted 
before and after the incubation at 78°C to monitor the formation 
of primer dimers. The mRNA level of the Tubulin gene was used as 
a control for the analysis. A reaction mixture without reverse 
transcriptase was also used as a control to confirm that no 
amplification occurred from genomic DNA contaminants in the 
RNA sample. In all of the PCR experiments, the amplification of a 
single DNA species was confirmed using both melting curve 
analysis of the quantitative PCR and gel electrophoresis of the 
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Figure 1. Overexpression of GmFT2a and GmFT5a causes 
precocious flowering in the soybean cultivar Williams 82. (A) 

The close shot of the transgenic plant in (B) shows the precocious 
flowers at the axils of the trifoliate leaves. (B) A transgenic GmFT2a plant 
showing precocious flowering at the axils of the trifoliate leaves. (C) A 
wild-type Williams 82 plant. (D) The close shot of the wild type Williams 
82 plant in (C) does not show flowers at the axils of the trifoliate leaves. 
(E) The close shot of the transgenic plant in (F) shows the precocious 
flowers at the axils of the trifoliate leaves (F) A transgenic GmFT5a plant 
showing precocious flowering at the axils of the trifoliate leaves. (G) A 
wild-type Williams 82 plant. (H) The close shot of the wild-type Williams 
82 plant in (G) does not show flowers at the axils of the trifoliate leaves. 
(I) Days to flowering from the emergence of the transgenic plants and 
wild-type plants. Averages and standard errors are calculated from four 
T2 plants for each construct and 5 Williams 82 plants. Double asterisks 
indicate significant differences from the corresponding wild-type 
Williams 82 at P<0.01. 
doi:1 0.1 371 /journal.pone.0097669.g001 

PCR products. The primers used for qRT-PCR and RT-PCR are 
listed in Table SI and Table S2, respectively. 

Identification of soybean FD, API, LFY and SOC1 
homologs. The database used for these searches is available 
at Phytozome (http://www.phytozome.net/soybean). Starting 
with the Arabidopsis FD, API, LFY and SOC 1 protein sequences, 
TBLASTN searches were conducted against the soybean (Glycine 
max) gene index (release 1.0). The top 18 .FD-like gene sequences 
producing high-scoring segment pairs were chosen and investigat- 
ed further. Primers were designed to amplify cDNAs for each of 
the top 18 iD-like genes (Table S2). Seven pairs of full-length CDS 
PCR (Table S3) primers were designed to amplify the seven 
expressed .FD-Like genes, and restriction sites (underlined) were 
included in the oligos to facilitate the cloning of the PCR products 



into the yeast vector pGADT7. A multiple sequence alignment 
and a neighbor-joining phylogenetic tree were constructed using 
DDBJ (http://clustalw.ddbj.nig.ac.jp/) online ClustalW software 
and Treeview 2.0. The tree was based on the full-length amino 
acid region including the bZIP domain and the SAP motif (Figure 
SI). The bootstrap percentage supports are indicated at the 
branches of the tree. 

Yeast two-hybridization assays 

The yeast cloning vectors pGBKT7 and pGADT7, the control 
vectors pGADT7-T and pGBKT7-53, and the yeast strain Y2H 
used in the yeast-two hybridization assays were obtained from the 
Clontech company (http://www.clontech.com/). The yeast two- 
hybridization assays were performed according to the manufac- 
turer's instructions. Soybean full-length CDS of GmFT2a and 
GmFT5a were inserted into pGBKT7 vectors to generate fused 
GAL4 DNA binding domains as the soybean baits. Full-length 
CDS of the three soybean .FZMike genes containing the SAP motif 
(GmFDLl 9, GmFDL08 and GmFDL15) were cloned into pGADT7 
to generate fused GAL4 DNA activation domains as the soybean 
preys, and the full-length CDS of Arabidopsis FD was also cloned 
into pGADT7 to generate a positive control. Table S4 lists the 
primers and restriction sites used to generate the yeast bait and 
prey constructs. The bait and prey plasmids were cotransformed 
into the yeast strain Y2H using the lithium acetate method and 
selected on SD medium lacking leucine (Leu) and tryptophan 
(Trp). After 4 days of incubation at 30°C, the yeast cells were 
replated on selection plates with SD medium lacking Leu, Trp, 
histidine (His) and adenine (Ade) but including the X-ot-gal 
substrate for the interaction test. 

Bimolecular fluorescence complementation 

The full-length CDS of GmFT2a and GmFT5a were amplified 
using the primer pairs Gm.FT.?a-NE-F/R and G»iFT5a-NE-F/R, 
respectively (Table S5), and were then inserted into the pUC_- 
SPYNE [42] vector, which contains the DNA encoding the N- 
terminus of YFP. The full-length cDNAs of GmFDLOS, GmFDL15, 
GmFDL19 and FD were amplified using the primer pairs listed in 
Table S5 and then inserted into the pUC_SPYCE [42] vector, 
which contains the DNA encoding the C-terminus of YFP. The 
recombined pUC_SPYNE/CE plasmids were cotransformed into 
Arabidopsis protoplasts using polyethylene glycol-mediated trans- 
fection, as described previously [43]. YFP-dependent fluorescence 
was detected 24 h after transfection using a confocal laser-scanning 
microscope (Zeiss LSM 510 Meta). 

Electrophoretic mobility shift assay 

The full-length coding region of GmFDL19 was amplified by 
PCR using the primer pair GmFDLl 9-29b-F/R (Table S6). The 
PCR product and the pET29b plasmid (Novagene, WL USA) 
were digested with Mdel and Sail. After ligation, the construct was 
transformed into the E. coli competent cell line BL21 (DE3) 
(Transgene, Beijing, China) according to the manufacturer's 
instructions. The recombinant GmFDLl 9 protein was purified 
using the His tag purification nickel ion system (Kangweishiji, 
Beijing, China). EMSA was conducted using the recombinant 
GmFDL19 protein and the DNA products of the GmAPla 
promoter obtained by hybridizing the forward and reverse 
complementary oligos containing the ACGT core sequence (Table 
S6). The EMSA assay was conducted using the EMSA kit 
(Invitrogen, www.Invitrogen.com, Cat #E33075). The DNA- 
protein complex samples were loaded into a TBE gradient 6% 
polyacrylamide native gel (Bio-Rad Laboratories, www.bio-rad. 
com) at 200 V for 45 minutes. The DNA in the gel was stained 
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Figure 2. Temporal and spatial expression of soybean flowering-related genes. (A) Transcript levels of eight soybean flowering-related 
genes (GmAPIa, GmAPIb, GmAPIc, GmAPId, GmSOCIa, GmSOCIb, GmLFYI, GmLFY2) in leaves and shoot apices of the soybean cultivar Harosoy 
under SD (12L/12D) conditions; Tubulin is included as an endogenous control. Samples were collected from 10 DAE to 25 DAE. DAE: days after 
emergence. L: leaves; S; shoot apex. (B) Tissue-specific expression analyses of eight flowering-related genes by RT-PCR under SD (1 2L/1 2D) conditions. 
L: leaves, S: shoot apices, F: flowers, FB: flower buds, P: pods, R: roots. 
doi:1 0.1 371 /journal.pone.0097669.g002 



using SYBR Green, provided in the same kit, and visualized using 
the GE Typhoon LFA 9500 Imaging System (GE Healthcare Life 
Science, USA). 

Results 

Overexpression of GmFT2a and GmFT5a causes 
precocious flowering in soybean 

Two FT homologs, GmFT2a and GmFTSa, are involved in 
photoperiod-regulated flowering, and these two genes coordinately 
control flowering in soybean [35] . To determine how these two FT 
homologs regulate soybean flowering, GmFT2a and GniFT5a were 
genetically transformed into the soybean cultivar Williams 82 
under the control of the cauliflower mosaic virus (CaMV) 35S 
promoter. The overexpression of GmFT2a and GmFT5a caused the 
early flowering of Williams 82 even under noninductive LD (16L/ 
8D) conditions (Figure 1A, B, C, D and Figure IE, F, G, H). The 
transgenic GmFT2a T2 overexpression line #2-1-1 flowered at 
approximately 33 days after emergence (DAE) and the transgenic 
GmFT5a T2 overexpression line #5-1 flowered at approximately 
35 DAE; however, the untransformed Williams 82 flowered at 
approximately 57 DAE (Figure II). These data suggested that both 
GmFT2a and GmFT5a are able to induce early flowering in 
soybean under noninductive LD conditions. 

GmFT2a and GmFT5a upregulate floral meristem identity 
genes 

All flowering pathways converge onto floral integrators, 
including FT and SOC1, and induce the expression of floral 
meristem identity genes, including API and LFY [44] , [45] . Several 
genes involved in the determination of flowering time have 
recently been isolated and characterized in soybean, including 
GmAPl, GmSOCl, GmSOCl-like and GmLFY [38], [39], [46]. By 
searching the soybean reference genome using Phytozome (http:// 
www.phytozome.net/soybean), four API homologs were identified 
and designated as GmAPIb (Glyma01g08150), GmAPIc (Gly- 
ma08g36380), GmAPId (Glyma02g 13420) and GmAPIa (Gly- 
mal6g 13070, GmAPl), the last of which has been characterized 
previously [39]. Spatial RT-PCR analyses suggested that GmAPIa, 
GmAPIb and GmAPIc were mainly transcribed in reproductive 
organs such as shoot apices, flower buds and flowers under SD 
(12L/12D) conditions in the cultivar Harosoy, with the transcrip- 
tion of GmAPIa being the most prominent; however, the 
transcription of GmAPId was not detected in any tissues 
(Figure 2A). Two LFY homologs, designated as GmLFYI 



(Glyma04g37900) [46] and GmLFY2 (Glyma06gl7170), were also 
identified in the soybean genome. GmLFYI was transcribed mainly 
in developing pods and seeds and was not detected in leaves or the 
SAM (shoot apex meristem) (Figure 2A), suggesting that the gene 
might contribute to seed development in soybean instead of 
flowering, as was previously reported [46]. Similarly to the API 
homologs, GmLFY2 was also transcribed in shoot apices, flower 
buds and flowers (Figure 2A). Two soybean S0C1 homologs, 
GmSOCIa (Glymal8g45780) [39] and GmSOCIb (Glyma09g40230) 
[38], could be identified from the soybean genome. Both of these 
genes were highly expressed in shoot apices, leaves, flower buds 
and roots but were weakly expressed in flowers and pods 
(Figure 2A), in agreement with the expression patterns observed 
for S0C1 in multiple organs of Arabidopsis [47]. In total, six floral 
meristem identity genes, GmAPIa, GmAPIb, GmAPIc, GmSOCIa, 
GmSOCIb and GmLFY2, were constandy expressed in the shoot 
apices of the cultivar Harosoy from 10 DAE to 25 DAE under SD 
( 1 2L/ 1 2D) conditions before the floral bud formation stage (floral 
buds formed at 25 DAE), and GmSOCIa and GmSOCIb were also 
constandy expressed in the leaves of the cultivar (Figure 2B). The 
constandy high expression levels of these genes in shoot apices 
before the floral bud formation stage most likely indicate their 
involvement in the flowering transition of soybean. 

In Arabidopsis, FT controls photoperiod-regulated flowering by 
activating the downstream flower meristem identity genes API, 
LFY and S0C1 [12]-[14], [48]-[50]. To determine whether 
GmFT2a and GmFT5a induce early flowering in soybean by 
regulating the orthologs of API, LFY and S0C1, the expression 
levels of GmAPl [a, b, c), GmSOCIa, GmSOCIb and GmLFY2 were 
determined using quantitative PCR in the leaves and shoot apices 
of the transgenic GmFT2a and GmFT5a overexpression soybean 
lines. In the transgenic GmFT2a soybean line #2-1-1, the 
expression levels of GmFT2a, GmAPl [a, b, c), GmSOCIa, GmSOCIb 
and GmLFY2 were significantly higher in the shoot apices than 
were levels in untransformed Williams 82 (Figure 3A), while the 
expression of GmFTSa remained unchanged. However, in the 
transgenic GmFT5a soybean line #5-1, the expression levels of 
GmFTSa, GmAPl [a, b, c), and GmSOCIb were significantly higher 
in the shoot apices than were levels in untransformed Williams 82 
(Figure 3B), while the expression levels of GmFT2a, GmSOCIa and 
GmLFY2 were unchanged (Figure 3B). These results suggest that 
the flowering genes GmAPls, GmSOCIa, GmSOCIb and GmLFY2 
are downstream of GmFT2a and GmFT5a and that these genes are 
most likely differentially involved in the GmFT2a and GmFT5a- 
induced early flowering of soybean. In addition, the expression of 
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Figure 3. GmFT2a and GmFT5a promote the expression of soybean flowering-related genes. (A) Expression analyses of GmFT2a, GmFT5a 
and flowering-related genes in transgenic GmFT2a plants (#2-1-1) and wild-type Williams 82 plants (WT). (B) Expression analyses of GmFT5a, GmFT2a 
and flowering-related genes in transgenic GmFT5a plants (#5-1) and wild-type Williams 82 plants (WT). The white and black columns represent 
relative expression in leaves and shoot apices, respectively. Asterisks and double asterisks indicate significant differences between transgenic and WT 
plants at 0.01<P<0.05 and P<0.01, respectively. 
doi:1 0.1 371 /journal.pone.0097669.g003 
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Figure 4. Seven GmFDLs (soybean F/>like genes) transcribed in leaves and shoot apices. (A) Transcript levels of seven GmFDLs in leaves 
and shoot apices of the soybean cultivar Harosoy under SD (12L/12D) conditions; Tubulin is included as an endogenous control. Samples were 
collected from 1 0 DAE to 25 DAE. L: leaves; S: shoot apices. (B) Multiple alignment of the amino acid sequences in the SAP motif region of the FDs 
from soybean and other species. The SAP motif is a putative sequence for FT binding. 
doi:1 0.1 371 /journal.pone.0097669.g004 



GmFT2a in the GmFT5a transgenic line and the expression of 
GmFT5a in the GmFT2a transgenic line were both unchanged in 
the leaves and shot apices, suggesting that GmFT2a and GmFT5a 
promote soybean flowering in a redundant manner (Figure 3). 

GmFT2a and GmFT5a interact with GmFDL19 

FT and its homologs are widely understood to move from leaves 
to shoot apices, where they interact with FDs to form FT/FD 
complexes that bind to the promoter of API [13], [18], [19]. In 
this study, we found that GmFT2a and GmFT5a promote 
flowering by inducing the expression of GmAPh, GmSCOls and 
GmLFT2, and it is easily assumed that GmFTs also require a 
partner such as GmFD to regulate the downstream flowering 
genes in soybean. Taking the amino acid sequence of FD from 
Arabidopsis as the query, we searched for orthologs of FD in the 
soybean genome using Phytozome and identified 18 high-scoring 
candidate GmFDLs (GmFD-Like) genes (Figure SI). Expression 
analyses were conducted using RT-PCR for all eighteen selected 
GmFDLs, of which seven GmFDLs were transcribed both in leaves 
and shoot apices (Figure 4A). A multiple sequence alignment of the 
seven soybean FD-like proteins with the FDs from other species 
shows a conserved bZIP domain of 42 amino acids (N-X7-R-X9- 
L-X6-L-X6-L) (Figure S2) and an SAP (RXXS/TAP) motif 
(Figure 4B), SAP motif has been reported as a putative binding 
sequence for FT [13]. Phylogenetic analysis was conducted based 
on the amino acid sequences of the 1 8 candidate GmFDLs, and 
the proteins were grouped into three clades (Figure SI, Table S7). 
GmFDL02, GmFDL04 and GmFDL0602 were divided into the 
FD clade, but these three GmFDLs did not transcribe or share an 
SAP motif. GmFDL08, GmFDL13, GmFDL15, GmFDL19 and 
GmFDL20 were divided into the wheat TaFDL2 clade, with 
GmFDL08, GmFDL15 and GmFDL19 sharing the classic SAP 
motif; these three GmFDLs were therefore tested for interactions 
with soybean GmFT2a and GmFT5a. GmFDL06 and GmFDL12 
were divided into the AREB and ABI5 cluster; these two proteins 
may represent the stress-related bZIP transcription factors in 
soybean. 

GmFT2a and GmFT5a promoted early flowering in Arabi- 
dopsis [35], [36], so we first examined the interactions of FD with 
GmFT2a and GmFT5a using the yeast two-hybridization assay. 
The results indicated that both GmFT2a and GmFT5a were able 



to interact with FD (Figure 5A). Autoactivation tests of GmFT2a 
and GmFT5a in yeast confirmed that both proteins were unable to 
activate the reporter genes when used alone as bait (data not 
presented). The yeast two-hybridization assays of the three SAP 
motif proteins among the GmFDLs, GmFDL08, GmFDL15 and 
GmFDL19, with GmFT2a and GmFT5a revealed that only 
GmFDL19 was able to interact with GmFT2a and GmFT5a 
(Figure 5A). To validate the results of the yeast two-hybridization 
tests, in vivo BiFC analyses were conducted. These results 
confirmed that both FD and GmFDL19 could interact with 
GmFT2a and GmFT5a in the nuclei of Arabidopsis protoplasts 
(Figure 5B). Our results suggest that GmFT2a and GmFT5a 
promote early flowering in soybean, most likely through interact- 
ing with GmFDL19 and upregulating downstream floral identity 
genes in a manner similar to that of FT in Arabidopsis. GmFDLl 9 
was constantly transcribed both in leaves and shoot apices, with 
increasing expression levels in the shoot apices following the 
growth stage from 10 DAE to 25 DAE before floral bud formation 
(25 DAE) (Figure 4A). In addition, the expression of GmFDLl 9 was 
upregulated by GmFT2a in the transgenic overexpression line 
#2- 1 - 1 , while the transcription of GmFDLl 9 was unchanged in the 
transgenic GmFT5a overexpression line #5-1 (Figure 3A, B). The 
expression patterns and protein interactions of GmFDL 1 9 strongly 
support this protein as the candidate soybean FD ortholog, which 
may participate differentially in the early flowering of soybean 
promoted by GmFT2a and GmFT5a. That is, GmFT2a promotes 
soybean early flowering through both transcriptional upregulation 
of and physical interaction with GmFDL19, but GmFT5a only 
promotes flowering through physical interaction with the protein. 

Overexpression of GmFDLl 9 causes early flowering in 
soybean 

The expression patterns and protein interactions of GmFDLl 9 
suggest that the protein might be involved in the GmFT2a- and 
GmFTSa-regulated flowering pathway in soybean. To determine 
the functions of GmFDL 1 9 in soybean flowering, overexpression 
of GmFDL19 driven by the 35S promoter was genetically 
transformed into the cultivar Dongnong 50 (Figure 6A, B). Under 
LD (16L/8D) conditions, this cultivar flowered early, at approx- 
imately 30 DAE. To observe clearer flowering differences between 
the transgenic T2 line #12-1 and untransformed Dongnong 50, 
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Figure 5. Interactions of GmFDLs with GmFT2a and GmFT5a. 

(A) Yeast two-hybridization assays; FD was also included because both 
GmFT2a and GmFT5a promoted early flowering in Arabidopsis. After 
cotransformation of the baits and preys, an equal amount of yeast 
clones were plated on SD-Leu-Trp and SD-Leu-Trp-His-Ade+X-cx-gal 
selective plates, and the plates were incubated at 30 C until the 
emergence of the yeast clones. (B) BiFC (bimolecular fluorescence 
complementation) assays to confirm the results of the yeast two- 
hybridization assays. Arabidopsis protoplasts cotransformed with 
constructs of FTs or FDs fused to the N-terminal (YN) and C-terminal 
(YC) halves of YFP, respectively (as indicated), were imaged using a 
confocal microscope after incubation at room temperature (20°C to 
25°C) over 18 hours. Images are shown as YFP, merged YFP and bright 
field. Scale bars indicate 20 |im. 
doi:1 0.1 371 /journal.pone.0097669.g005 

we evaluated the flowering times of both lines under longer 
photoperiod LD (18L/6D) conditions. Under these conditions, the 
transgenic line #12-1 flowered, on average, at approximately 43 
DAE, while Dongnong 50 flowered at 55 DAE, indicating that 
GmFDL19 is able to promote flowering in soybean (Figure 6A, B, 
C, D, E). 

We have demonstrated that GmFT2a and GmFTSa promote 
flowering by inducing the expression of flowering-related genes 
and that GmFDL19 transgenic plants flower earlier than untrans- 
formed Dongnong 50 plants under an LD photoperiod. We next 
wished to determine whether these flowering-related genes were 
also upregulated in the shoot apices of GmFDL19 transgenic plants. 
As expected, the transcription levels of GmFDL19, as well as those 
of the flowering related genes GmAPh, GmSOCh and GmLFT2, 
were significantly higher in the shoot apices of the transgenic 
GmFDL19 line #12-1 than in those of untransformed Dongnong 
50 (Figure 6F). However, the expression levels of GmFT2a and 
GmFT5a in the transgenic GmFDL19 line #12-1 were unchanged 
and were only faintly detected in both #12-1 and Dongnong 50 



under the LD (18L/6D) conditions (Figure 6F). Considering the 
upregulation of GmFDL19 by GmFT2a, as well as the fact that 
GmFDL19 has the same effect on the upregulation of GmAPh, 
GmSOCls and GmLFT2 as do GmFT2a, the results suggest that 
GmFDL19 may act downstream of GmFT2a in the regulation of 
the flowering transition in soybean. In addition to the transcrip- 
tional upregulation of GmFDLlQ by GmFT2a, GmFDL19 
interacts with both GmFT2a and GmFT5a, suggesting that 
GmFDL19 may be required for GmFT2a- and GmFT5a- 
regulated flowering in soybean. 

GmFDL19 binds to the GmAPla promoter in vitro 

FT-like proteins interact with FD-like proteins to form FT/FD 
complexes, which bind to the core ACGT cis-elements located at 
the promoters of ^LPi -like genes in Arabidopsis, rice and wheat 
[14], [18], [19]. To determine whether this mechanism is 
conserved in soybean, we conducted an electrophoretic mobility 
shift assay (EMSA) to test the binding of GmFDL19 with the 
promoter of GmAPla. GmAPla was selected for the binding assay 
because it showed a higher expression level than did the other two 
homologs, GmAPlb and GmAPlc (Figure 2, Figure 3 and Figure 6F). 
This gene contains seven ACGT core elements and one CArG 
box, representing the putative binding site for MADS domain 
transcription factors in its promoter (Figure 7). The EMSA results 
demonstrated that GmFD19 binds to the ACGT core sequences in 
vitro. As the negative control, the CArG box could not be bound by 
GmFDL19 (Figure 7). These results suggest that the FT/FD-AP1 
pathway is well conserved in soybean and that GmFDL19 serves 
as an important component of GmFT2a- and GmFTSa-regulated 
flowering in the legume. 

Discussion 

Photoperiod-regulated flowering is integrated through 
GmFT2a and GmFT5a in soybean 

Soybean is adapted to wide range of latitudes, from at least 
North 50° to South 35°, and its wide adaptability can most likely 
be attributed to the genetic diversity at many of the major genes 
(El to E8 and J) and unclassified quantitative trait loci controlling 
its flowering and maturity. Of the major maturity genes, El to E4 
delay flowering and maturity under LD but have no effects on 
flowering and maturity under SD [51]. The molecular identities of 
El to E4 have recently been characterized [52]. The El gene has 
the largest effect of the maturity genes on delaying flowering and 
maturity under LD conditions [29], [33], [51], [53] and has been 
cloned using a map-based approach, revealing it to be a legume- 
specific transcription repressor with a putative nuclear localization 
signal (NLS) and a B3 distantly related domain [29]. The 
repression of flowering by E 1 is most likely due to its suppression 
of the transcription of GmFT2a and GmFTSa under LD conditions 
[29], [36]. El is transcribed mainly in vegetative organs such as 
cotyledons and leaves [29], which is consistent with the 
transcription sites of GmFT2a and GmFT5a [35]. The diurnal 
circadian rhythm of El transcription contains two peaks in the 
leaves within 24 hours [29], of which the first transcription peak 
overlaps with the transcription peaks of GmFT2a and GmFT5a at 
4 hours after dawn [35], These results suggest that GmFT2a and 
GmFT5a are most likely the direct targets of E 1 regulation, but this 
hypothesis requires further evidence for verification. E3 and E4 
encode the light receptor phytochrome A (PHYA) proteins 
GmPHYA3 and GmPHYA2, respectively [31], [32]. The 
expression levels of GmFT2a and GmFT5a were additively 
suppressed by E3 and E4 [35], perhaps indirectly via El function, 
as shown in genetic studies revealing that E3 and E4 have epistatic 



PLOS ONE | www.plosone.org 



7 



May 2014 | Volume 9 | Issue 5 | e97669 



Soybean FT Proteins Interact with GmFDL19 to Induce Flowering 




#12-1 Dongnong 50 




1 i 1 U i I U.U i I 

WT #12-1 WT #12-1 WT #12-1 



Figure 6. Overexpression of GmFDL 19 in the soybean cultivar Dongnong 50 causes early flowering. (A) The close shot of the transgenic 
plant in (B) shows the precocious flowers at the axils of the trifoliate leaves (B) A transgenic GmFDL19 plant showing precocious flowering at the axils 
of the trifoliate leaves. (C) A wild-type Dongnong 50 plant. (D) The close shot of the wild-type Dongnong 50 plant in (C) does not show flowers at the 
axils of the trifoliate leaves. (E) Number of days to flowering in transgenic and wild-type plants. Averages and standard errors are calculated from five 
independent T2 plants and five Dongnong 50 plants. (F) Expression analyses of GmFDL19 and flowering-related genes in transgenic GmFDL19 plants 
(#12-1) and wild-type Dongnong 50 plants (WT); because these flowering related genes are transcribed mostly in shoot apices, shoot apex samples 
were collected from transgenic and wild-type Dongnong 50 plants at 40 DAE under LD (18L/6D) conditions. Asterisks and double asterisks indicate 
significant differences from the corresponding wild-type Dongnong 50 at 0.01<P<0.05 and P<0.01, respectively. 
doi:1 0.1 371 /journal.pone.0097669.g006 



effects on El under LD conditions [29]. E2 was identified 
molecularly as an ortholog of the Arabidopsis GIGAJVTEA gene 
[30]. The E2 gene mainly controls flowering time through the 
regulation of GmFT2a, not GmFT5a [30]. Taken together, these 
results indicate that the photoperiod-regulated flowering pathway 
in soybean converges at GmFT2a and GmFT5a. 



GmFT2a and GmFT5a redundantly and differentially 
regulate flowering in soybean 

An SD-to-LD transfer experiment demonstrated the differences 
in photoperiod response between GmFT2a and GmFT5a. The 
expression of GmFT2a was strictiy regulated by photoperiodic 
changes from SD to LD, whereas the response of GmFT5a to 
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Figure 7. The GmFDL19 protein specifically binds with the ACGT core sequence in vitro. Potential bZIP binding sites presented in the 
GmAPla promoter were used as probes in binding reactions with the purified recombinant GmFDL19 protein. The eight probes included seven 
potential bZIP binding sites and a CArG-box as negative control. (1 ) T-box (A ACGTT ), (2) A/C hybrid of A and C-box (T ACGT C), (3) T/C hybrid of T and 
C-box (AACGTC), (4) CArG-box (CCNNNNNNNNGG), (5) G/A hybrid of G and A-box (C ACGT A), (6) T/A hybrid of T and A-box (A ACGT A), (7) G/A hybrid 
of G and A-box (CACGTA), (8) T-box (AACGTT). The scheme below indicates the positions of the various bZIP binding sites. 
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photoperiodic changes was gradual, and its expression was 
maintained at low levels even after the plants were transferred to 
LD [35]. These findings suggest that, in addition to the phyA- 
mediated photoperiod response, a second regulatory mechanism 
may be involved in the differences of expression pattern between 
GmFT2a and GmFTSa. In addition to E3 and E4, E2 influences the 
mRNA abundance of i^Thomologs. Watanabe et al. (201 1) found 
a clear association between flowering time and GmFT2a expression 
in two sets of near isogenic lines (NILs) for the E2 locus [30]; 
dysfunctional e2 alleles promoted GmFT2a expression and 
conditioned earlier flowering. However, these authors did not 
observe significant differences in GmFTSa expression between the 
NILs. These results suggest that the E2 gene (GmGIa) mainly 
controls flowering time through the regulation of GmFT2a [30]. 
The different responses to photoperiodic changes observed 
between GmFT2a and GmFT5a [35] may thus be caused by the 
involvement of the GI (E2)-regulated pathway in GmFT2a 
expression, but not in GmFT5a expression. Under the phyA (E3 
and E4)-mediated photoperiodic regulation system, GmFT2a and 
GmFT5a may redundantly and strongly induce flowering under 
shorter day lengths, but GmFT5a alone may promote flowering in 
a photoperiod-independent manner under longer day lengths. 

The expression analyses of GmFT2a and GmFT5a in their 
respective transgenic overexpression lines further demonstrate that 
GmFT2a and GmFT5a are not regulated by each other, 
suggesting that GmFT2a and GmFTSa induce soybean flowering 
redundantly (Figure 3A, B). GmFT2a significantly upregulates 
downstream floral identity genes such as GmAPl [a, b, c), GmSOCla, 
GmSOClb and GmLFT2. However, GmFT5a only upregulates 
GmAPl (a, b, c) and GmSOClb and has no effect on GmSOCla and 
GmLFY2 (Figure 3A, B). In addition, the expression of GmFDL19 
was upregulated by GmFT2a in the transgenic overexpression line 
#2-1-1 while the transcription of the gene was unchanged in the 
transgenic GmFTSa overexpression line #5-1 (Figure 3A, B). A 
hypothesis for the system was developed: GmFT2a, in combina- 
tion with GmFDL19, triggers the upregulation of GmLFY2, and 
GmLFY2 then feeds back directly to further upregulate GmFDL19. 
However, this hypothesis requires further confirmation. These 
results suggest that GmFT2a and GmFT5a induce soybean 
flowering differentially and redundantly. The GmFT2a-regulated 
flowering pathway and the GmFTSa-regulated flowering pathway 



may be integrated in the SAM and are redundantly balanced in a 
very complex manner to ensure precise flowering in paleopoly- 
ploid soybean. These two FT homologs may therefore coordi- 
nately and redundantly control flowering in soybean. 

GmFDL19 may be involved in GmFT2a- and 
GmFT5a-regulated flowering in soybean 

FT and its homologs are widely known to move from leaves to 
shoot apices, where they interact with FDs to form FT/FD 
complexes that bind to the promoter of API and induce flowering 
in many plant species [13], [18], [19]. In this study, we report that 
the bZIP transcription factor GmFDL19 is able to physically 
interact with two soybean FT homologs, GmFT2a and GmFT5a, 
as confirmed by both yeast two-hybridization in vitro and BiFC in 
vivo. The binding of GmFDL19 with the cis-elements in the 
promoter of the API soybean ortholog GmAPla was further 
confirmed by EMSA in vitro. Our results further extend the 
regulatory module of FT/FD-AP1 in the legume species soybean. 
In addition to the interaction of GmFDL19 with GmFT2a and 
GmFTSa, GmFT2a is able to upregulate the transcription of 
GmFDL19 in shoot apices. The transcription levels of GmFT2a 
and GmFT5a are not regulated by GmFDL19, suggesting that 
GmFDL19 functions downstream of GmFT2a and GmFTSa. The 
floral identity genes GmAPl [a, b, c), GmSOCla, GmSOClb and 
GmLFT2 are upregulated by GmFDL19, GmFT2a and GmFT5a 
in their respective transgenic overexpression lines. Taken together, 
these results suggest that GmFDL19 may be involved in GmFT2a- 
and GmFTSa-regulated flowering in soybean. 

In summary, we propose a molecular network of photoperiod- 
regulated flowering in soybean (Figure 8). Under SD conditions, 
the E3, E4 and El genes do not express in leaves where GmFT2a 
and GmFT5a are able to transcribe at high levels, and the 
GmFT2a and GmFT5a proteins are then transported from the 
leaves to the shoot apices, where they bind with GmFDL19 to 
induce the expression of flowering-related genes [GmAPl, 
GmSOCl, GmLFY), thus leading to early flowering. Under LD 
conditions, E3 and E4 genes are highly transcribed in leaves, 
where they epistatically induce the high expression of the El gene, 
thereby suppressing the expression levels of GmFT2a and GmFTSa. 
The expression levels of flowering related genes are not 
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Figure 8. Proposed mechanism of photoperiod-regulated flowering controlled by GmFT2a and GmFT5a in soybean. (A) Model of 
GmFT2a and GmFT5a regulating the expression of GmAPIa. The horizontal dotted line represents the GmAPIa promoter, and the black vertical bars 
indicate the eight exons of GmAPIa. The green oval represents the GmFDL19 protein, and this protein can bind to the T-box, G-box or hybrid box 
(white rectangle) in the GmAPIa promoter. The orange oval represents the interactions of GmFT2a and GmFT5a with GmFDL19. (B) A proposed 
molecular network for photoperiod-regulated flowering in soybean. 
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upregulated due to lack of FT proteins, leading to a late-flowering 
phenotype. 

Supporting Information 

Figure SI Phylogenetic relationship of soybean FD-like 
proteins and FDs from other species constructed using 
the neighbor-joining method with the program CLUS- 

TAL W. Bootstrap percentage supports are indicated at the 
branches of the tree. The seven red filled rectangles indicate the 
bZIP domain of seven expressed .FD-like genes in soybean, and the 
red rectangles indicated the SAP motif contained in soybean FD- 
like proteins and FDs from other species. The locus IDs or 
accession numbers of these FDs are presented in Table S7. 
(TIF) 

Figure S2 Conserved bZIP domain of the seven soybean 
FD-like proteins and FDs from other species. 

(TIF) 



Table SI Primers for qRT-PCR analysis. 

(PDF) 

Table S2 Primers for RT-PCR analysis. 

(PDF) 

Table S3 Primers for isolation of seven GmFDLs. 

(PDF) 

Table S4 Primers for yeast two-hybridization assays. 

(PDF) 

Table S5 Primers for BiFC. 

(PDF) 

Table S6 Primers for EMSA. 

(PDF) 

Table S7 List of FD homologs contained in the phylo- 
genetic analysis used in the present study. 

(PDF) 
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